Cross-synthesis Based on spectrogram Factorization
نویسنده
چکیده
Spectrogram factorization techniques decompose a sound into a set of characteristic spectral shapes and a set of corresponding temporal evolutions. This can be exploited for a cross-synthesis-like processing by combining the spectral shapes of one sound with the temporal evolutions of the other. A system is proposed that implements such a task in an unsupervised way by means of a comparison of the involved spectral shapes in terms of timbral similarity, and a phase reconstruction algorithm for the resynthesis. The system enables cross-synthesis at the level of intra-note resonances, transients or temporalities. Some illustrative sound generation examples will be presented and discussed.
منابع مشابه
A framework for music analysis/resynthesis based on matrix factorization
Spectrogram factorization is a recent and promising alternative to sinusoidal or source/filter modeling for analysis/resynthesis systems aimed at musical creation. This paper presents a framework designed to perform a wide range of sound manipulations based on Non-negative Matrix Factorization (NMF), including a set of new techniques for creating artificial cross-components not present in the o...
متن کاملAutomatic Glottal Inverse Filtering with Non-Negative Matrix Factorization
This study presents an automatic glottal inverse filtering (GIF) technique based on separating the effect of the glottal main excitation from the impulse response of the vocal tract. The proposed method is based on a non-negative matrix factorization (NMF) based decomposition of an ultra short-term spectrogram of the analyzed signal. Unlike other state-of-theart GIF techniques, the proposed met...
متن کاملSpectrogram Factorization Using Phase Information
Spectrogram factorization methods have been proposed for single channel source separation [1–4], audio analysis [5–8] and more recently multichannel source separation [9–11]. All spectrogram factorization approaches incorrectly assume that the mixture spectrogram is the sum of the source spectrograms. In fact, the mixture spectrogram depends on the source spectrograms and the phase difference b...
متن کاملSpeech Enhancement Using Non-Negative Spectrogram Models with Mel-Generalized Cepstral Regularization
Spectral domain speech enhancement algorithms based on nonnegative spectrogram models such as non-negative matrix factorization (NMF) and non-negative matrix factor deconvolution are powerful in terms of signal recovery accuracy, however they do not directly lead to an enhancement in the feature domain (e.g., cepstral domain) or in terms of perceived quality. We have previously proposed a metho...
متن کاملLow-Rank Time-Frequency Synthesis
Many single-channel signal decomposition techniques rely on a low-rank factorization of a time-frequency transform. In particular, nonnegative matrix factorization (NMF) of the spectrogram – the (power) magnitude of the short-time Fourier transform (STFT) – has been considered in many audio applications. In this setting, NMF with the Itakura-Saito divergence was shown to underly a generative Ga...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2013